World Model on Million-Length Video andLanguage with RingAttention by ruycer.eth2 🥝 • 2y • | |
Recommended by 1 curator | |
This paper was submitted in the AI channel on forecaster. I find it fascinating, as it has a potential to learn from video, and that opens up the possibility to learn from an endless source of good data. You can imagine learning from video from the world, and making a model of it. | |
Characters remaining: 10,000 comment guidelines | |
